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Abstract. In 1947 Fine obtained an expression for the number ap(ra) of bi- 
nomial coefficients on row n of Pascal's triangle that are nonzero modulo p. 
In this paper we use Kummer's theorem to generalize Fine's theorem to prime 
powers, expressing the number a^a (n) of nonzero binomial coefficients modulo 
as a sum over certain integer partitions. For fixed a, this expression can be 
rewritten to show explicit dependence on the number of occurrences of each 
subword in the base-p representation of n. 



1. Introduction 

The study of arithmetic properties of binomial coefficients has a rich history. A 
main theme is that properties of ( "J modulo a prime p are related to the base-p 
representations of n and m. Let niui^i ■ • • no be the word consisting of the standard 
base-6 digits of a nonnegative integer n. We use n and nini^i ■ ■ ■ uq interchangeably. 
We consider the base-& representation of to be the empty word e. For < m < n 
we write rn;m;_i • ■ • mo for the base-6 representation of m, where we pad with zeros 
if it is otherwise shorter than nini^i ■ ■ ■ uq. With the exception of the proposition 
in Section [21 we will take b = p to he prime. 

Two of the classic results are Kummer's theorem of 1852 [151 pages 115-116] and 
Lucas' theorem of 1878 [TB]. 

Theorem (Kummer). Let p be a prime, and let < m < n. The exponent of the 
highest power of p dividing (J^) is the number of borrows involved in subtracting m 
from n in base p. 

Theorem (Lucas). Let p be a prime, and let < m < n. Then 



m 



Let afe(n) be the number of integers < m < n such that (^j ^ mod k; that 
is, afe(n) is the number of nonzero entries on row n of Pascal's triangle modulo k. 
Let \n\w be the number of occurrences of the word w in n;n/_i ■ • • no. 

In 1899 Glaisher [8l §14] initiated the study of counting entries on row n of 
Pascal's triangle modulo k by using Lucas' theorem to determine a2(n) — 2l"li. 
The proof is simple: In order that ("J be odd, each term (^' ) in the product must 
be 1, so if rii — then mi — and if n^ = 1 then nii can be either or 1. 

In 1947 Fine [6j generalized Glaisher's result to an arbitrary prime. Fine's result 
follows from Lucas' theorem in the same way. 
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Theorem (Fine). Let p be a prime, and letn>0. The number of nonzero entries 
on row n of Pascal's triangle modulo p is 

I 

ap{n) = (n^ + 1) . 

i=0 

Note that Fine's expression may be rewritten as 

p-i 

ap{n) = l[{r + iy^\^, 

r=0 

which more directly identifies the contribution of each digit < r < p — 1. 

In Section [2] we generahze Fine's result to prime powers, obtaining a formula for 
apa{n). In Section [3] we provide an algorithm for rewriting this formula in terms of 
\n\w, as we have just done with Fine's expression. Previously, such formulas were 
only known for ai{n), ag{n), and as{n). 

We mention that one may generalize Glaisher's result in a different direction, 
namely to ask for the number ak^rin) of integers < m < n such that = r 
mod k. In this context, Fine's result is an evaluation of the sum over all nonzero 
residue classes when k = p is prime, and the main result of this paper is an evalu- 
ation of the sum 

apo,{n) = ^ Op^^rin) 

for a prime power modulus. 

There have been several studies of ak,r(n). For prime k = p, Hexel and Sachs fllj 
§5] determined a formula for ap riin) in terms of (p — l)th roots of unity, where r 
is a primitive root modulo p, and from this obtained a3,i{n) = 2l"li~^(3l"l^ + 1), 
0'3,2{n) = 2l"li^^(3l"l^ — 1), and explicit formulas for a5,,i(n) in terms of |n|i, 
\n\2, |n|3, and \n\4. Garfield and Wilf |7J provided an algorithm to compute the 
generating function Y^^=o '^p^r' {n)x'' , where again r is a primitive root. Recently, 
Amdeberhan and Stanley [21 Theorem 2.1] studied the number of coefficients equal 
to r in the nth power of a general multivariate polynomial over a finite field, where 
r is an invertible element of the field. 

In the late 1980s researchers began to consider ak.r{n) for certain prime power 
moduli k = p°'. Davis and Webb [3] gave formulas for 04^1 (n), 04^2 (^t-), and 04^3 (n) 
in terms of |n|i, |n|io, and |n|ii. Around the same time, Granville [9] showed that if 
r is odd and a G {2, 3} then a2'',r{n) is either or a power of 2. Huard, Spearman, 
and WiUiams [TH[T3] gave formulas for aQ^r{n) and as,,r{n) (and their sums, which 
we derive again below). Some of these results use a generalization of Lucas' theorem 
to prime powers found by Davis and Webb ^ . 

There has also been some general work on squares of primes. Huard, Spearman, 
and Williams [13] used the result of Hexel and Sachs to find, when p \ r and r ^ 0, 
a formula for ap2 j.(n) depending only on \n\^ for words w of length at most 2. 
Earlier, Webb [TH Theorem 3] showed if p f r then ap2^(n) does not depend only 
the subwords of length at most 2 but does depend only on the blocks of nonzero 
digits in n. However, the corollary in the next section implies that by summing 
ap2 j.(n) over all nonzero residue classes r modulo p^ the dependence on only the 
subwords of length at most 2 is achieved. 
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We would be remiss to not mention Granville's thorough survey |10| . which 
discusses many additional arithmetic aspects of binomial coefficients and provides 
another generalization of Lucas' theorem to prime powers. 

2. Generalizing Fine's theorem 

We adopt the usual conventions that an empty sum is 0, an empty product is 1, 
and there is precisely one integer partition of (namely, the empty set). 
For > 1, let 

/ \ ^fc / TT ^ ~ '^k-h \ b-wo-l 

c{wkWk-i • • • Wo = — r ■ I I — r — 

Wk + l \f^^^ Wk~h + 1 y wo + l 

The function c assigns a rational number to a word on the alphabet {0,1,...,&— 1}. 
We will see this function arise naturally in the proof of Theorem [TJ 

For a nonnegative integer 7, let Saij) be the set of integer partitions of 7 into 
at least max(0,7 — {a — 1)) parts, all of size at least 2. For example, 

^8(10) = {{6, 2, 2}, {5, 3, 2}, {4, 4, 2}, {4, 3, 3}, {4, 2, 2, 2}, {3, 3, 2, 2}, {2, 2, 2, 2, 2}}. 

Let \P\ be the number of parts in the integer partition P. Let 

J2 civHw) ■ ■ ■ c{z) 

n/P 

be the sum over the sets {v,w,...,z} of |P| nonoverlapping subwords of n = 
niUi-i ■ ■ - no such that the multiset {\v\, \w\, . . . , |z|} of subword lengths is equal 
to P. In this sum we consider two subwords rii-^ni-^^i ■ ■ ■ Uf-^ and rii^ni^^i ■ ■ ■ rif^ 
to be distinct precisely when ii or /i 7^/2, so it would be more precise (but 
more cumbersome) to say that X^n/p ^ svaa. over certain sets of pairs of indices. 
For example, if n = nc,n/^n^n2ninQ then 

^ c{v)c{w) ■ ■ ■ c{z) 

n/{3,2} 

= c{n^rnnz)c{n2ni) + c{n^rnn'i)c{ninQ) + 0(714^3712)0(711710) 
+ 0(7757x4)0(773772711) + 0(715774)0(772771770) + 0(774773)0(772771770) 

and 

^ c{v)c{w) ■ ■ ■ c{z) 

n/{2,2} 

= c{nr,n4)c{n3n2) + 0(775774)0(772771) + 0(775774)0(771770) 
+ 0(774773)0(772771) + 0(774773)0(771770) + 0(773772)0(711710)- 
We now have the notation to state the main result of the paper. 

Theorem 1. Let p be a prime, let a >0, and let 77 > 0. The number of nonzero 
entries on row n of Pascal 's triangle modulo p" is 

I I \ 2("-l) 

ap°(")= n + J2 I]c(7;)o(7i;)---o(z). 

\i=0 / 7=0 PeSa{7)n/P 
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Note that if it is convenient we may extend the sum over iS'q(7) to a sum over 
partitions including 1 if we set c{wo) — 0. 

Theorem [T] follows from the following proposition. Let b > 2, and let A„(/3) 
be the number of integers < to < n such that there are exactly (3 borrows 
involved in computing n — to in base b. Let S{'y, S) be the set of integer parti- 
tions of 7 into S parts where all parts are at least 2. For example, 5(10, 3) ~ 



Everett [S] gave a different expression for A„ {(3) as a sum over all length-Z words 
on {0, 1} with precisely /3 Is. Everett's expression is simpler to state and faster to 
compute for an explicit integer n. However, because of its high-level dependence 
on it is farther away from being able to produce formulas in terms of subword 
counts. Note that neither expression for A„(/3) relies on the base b being prime. 

Now let b — p be prime. By Rummer's theorem, (^) ^ mod precisely 
when there are fewer than a borrows when subtracting m from n in base p. There- 
fore apcin) — X]^=o^«(/^)- Substituting the expression for A„(/3)/A„(0) in the 
proposition and interchanging the two outermost sums gives the statement of the 
theorem. 

Therefore it suffices to prove the proposition. For n = nini-i ■ ■ ■ uq and to = 
TO;TO;_i ■ • ■ Too, let n' = ni-i ■ ■ - no and to' — mi-i ■ ■ ■ toq. Furthermore, let n'*-' = 
ni-i ■ ■ ■ no. 

Proof of the proposition. We first find a recurrence for by establishing the 

relationship between borrows in n — to and borrows in n' — m'. Since A„(/3) = 
when P > I, it suffices to consider (3 <l. 

It may happen that m' > n' even if m < n, so we must decide how to count 
borrows in the computation of n' — m' in this case. The standard subtraction 
algorithm produces infinitely many borrows. However, the only borrows that are 
preserved when passing from n' — to' to n — to are those up through the borrow 
from the lt\v digit in n' (which is 0). Therefore, let Bn{P) be the number of integers 
n < m < 6'+^ — 1 such that there are exactly /3 borrows up through the borrow 
from nj+i = involved in computing n — m. 

Now we write A„(/3) in terms of An' (/3) and i3„' (/3). In the computation of n — to, 
a borrow from the digit rii^i occurs if to^ > n^. Moreover, if there is a borrow from 
rii then the borrow is propagated to rii+i whenever rui > Ui — 1. Thus if to' < n' 
then there are ni + 1 choices for to; such that m < n. Similarly, if to' > n' then 
there are n; choices for mi such that m < n. Therefore 



We find a recurrence for Bn{(3) analogously: If m' < n' then there are b — ni — 1 
choices for to; such that m > n. If m' > n' then there are b — ni choices for mi 
such that m > n. In each case we gain one additional borrow, so 



{{6,2,2},{5,3,2},{4,4,2},{4,3,3}}. 



Proposition. Let b > 2, let a > 0, and let n > 0. Then 




An{l3) - {ni + l)An'{f3)+niB,Af3)- 



Bnifi) - (6 - ni - l)A^,{(3 - I) + {b - ni)B^\(3 - 1). 
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Iteratively substituting the equation for B„{j5) into the equation for An{j3) until 
we reach (0) = produces the recurrence 

An{p) = {ni + l)An'{l5) + Y.'^i (n(^-"'-^)) {b-ni-i-l)A^^,+.){p-i) 

i=l \3 = 1 J 

for n > 1 and < /3 < /. Divide both sides of this recurrence by ^„(0), which (as 
in Fine's theorem) is 111=0 + 1)' obtain 

where c{wkWk-i ■ • -wq) is as defined above. Replacing n with n'-^^ in this equation 
and summing over < j < I — P causes the left side to telescope, and we see that 

An{P) A„a-f3+i){P) v^v^ / N An(j+i+i){^ - i) 

If /3 > 1 then A^(,-f,+i) {p) = 0, and if ^ = then = ^^(0) = 1. 

Wc now verify that the expression for A„(/3)/j4„(0) given in the statement of 
the proposition satisfies this recurrence and the correct boundary conditions. The 
boundary conditions are easily checked; for /3 = the expression is 1, and for /3 > Z 
it is 0. After substituting, the right side of the recurrence is 

l-P p 2(/3-i) 

^^c{ni-yni-j-i) XI X] X c{v)c{w) ■ ■ ■ c{z) 

j=0 i=l 7=/3-» PeS{^,i-l3+i) /P 

= XX X X X c{ni-j---ni-j-i)c{v)c{w)---c{z) 

j=0 i=l 7=0+1 PeS(7-l-i, 7-1-/3) n(3+»+i)/P 

= X X X X X c{ni-j---ni-j-i)c{v)c{w)---c{z) 

7=0+1 i=l j=o PGS(7-l-i, 7-1-/3) n(J+*+i)/P 

after shifting 7 i—> 7 — 1 — z and interchanging the sums over i and 7. 

Momentarily fix ^ + 1 < 7 < 2/3. For each l<i<2/3 + l — 7, take each integer 
composition of 7 — 1 — i into 7 — 1 — /3 parts, where all parts are at least 2, and 
prepend z + 1 to get a composition of 7 into 7 — /3 parts at least 2. In doing this 
we form each composition of 7 into 7-/3 parts at least 2 precisely once. Therefore 

20+1-7 

X X X X c(n;_j ■ • ■ n;_j_i)c(i;)c(M;) • • • c(z) 

i=l j=0 Pe5(7-l-'i,7-l-0) n(3+» + i) /P 
PeS(7,7-/3) n/P 

The right side of the recurrence then becomes 

2/3 

7=/3+l PeS(7,7-/3) n/P 
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which if /3 > 1 is equal to 

E E E ■ • ■ 

7=/3PeS(7,7-/3)n/P 

and if /3 = is equal to 

2/3 

E E E^(")^H---^w-i = o 

7=/3PeS(7,7-/3)n/P 

as desired. □ 

We mention that the recurrences appearing early in the proof are sufficient to 
compute ttpo (n) symbolically for fixed a; for example, for /3 = we have 

ArM = (n/ + l)A„,(0), 

giving Fine's theorem 

I 

apin) = A„(0) = J]^ (n, + 1) . 

i=0 

Of course, Fine's theorem also follows from the full statement of Theorem[T] 6*1(0) = 
{{}}, so the inner sum is a sum over one term, and the summand is the empty 
product. 

For a = 2 we have 62(0) = {{}}, S'2(l) = {}, and 62(2) = {{2}}, so 



ap2 (n) 



fri(-. + i))-(Ei+E^H| 

V=0 / \„/{} „/{2} / 



The second sum is simply the sum over all subwords of length 2, so we have proved 
the following corollary. 

Corollary. Let p he a prime, and let n > 0. The number of nonzero entries on 
row n of Pascal's triangle modulo p^ is 

/I \ / i-i -, \ 



\z— / \ 1—0 / 

Brief words are in order regarding how one can experimentally guess the general 
expression for A„(/3)/A„(0) given by the proposition once one knows the recurrence 

(13) , , A„(,+,,„(/3-») 

It is clear from this recurrence that the fully resolved expression for A„ (/3)/74„ (0) 
is a sum of terms of the form c{v)c{w) • ■ • c{z), where {v,w, . . . , z) is a tuple of 
nonoverlapping subwords of n. For example, if n = n5n4n3n2nino then 

An{3)/An{0) = c(n5n4n3n2) + c{nin^n2ni) + c{n3n2nino) 

+ c{n5n4n^)c{n2ni) + c(n5ri4n3)c(nino) + c(n4n3n2)c(nino) 
+ c(n5n4)c(n3n2ni) + c{n^n4)c{n2ninQ) + c(n4n3)c(n2nino) 

+ c(n5n4)c(n3n2)c(nino). 
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Moreover, each tuple appears at most once. Thus it suffices to determine which 
tuples appear. 

Upon explicitly computing A„(3)/A„(0) and several additional values, one ob- 
serves that if a tuple {v,w,...,z) appears in A„(/3)/A„(0) and {v,w, . . . , z) is 
a tuple of nonoverlapping subwords of the same length such that the multisets 
\w\, . . . , |z|} and {\v\, \w\, . . . ^\z\} are equal, then (w, ?i, . . . , z) also seems to ap- 
pear, regardless of the order that the subwords of either tuple occur in n. For exam- 
ple, all pairs of nonoverlapping subwords with lengths {3, 2} appear in A„(3)/A„(0). 
So presumably it suffices to determine which multisets of subword lengths appear 
for a given 13 (and I). From the data, one guesses that the multisets are certain 
integer partitions of integers 7 into 7 — /? parts, hence the proposition. For example, 
the set of partitions appearing in A„(3)/A„(0) is {{4}, {3, 2}, {2, 2, 2}}. 



3. Expressions in terms of \n\^ 

In this section we describe, for fixed a, how to rewrite the expression for Opc (n) 
of the previous section to show explicit dependence on the subword counts for 
symbolic n. Note that if w begins with or ends with p — 1 then c{w) = 0, so 
apa{n) does not depend on |n|u,. 

In Section [1] we did this for Fine's theorem (where a = 1). For a = 2 it is also 
done easily; collecting identical terms of the sum appearing in the corollary yields 

\«;o=0 / \ wi=Owo=0 i u / 

Now any explicit prime can be substituted to produce a formula. For example, 
p = 2 gives ai{n) = 2l"li(l -I- i|n|io). For p — 3 we have 

agin) = 2l"li3l"l^ (^1 + |?i|io + ^\n\n + ^\n\2o + ^\n\2i 
(first found by Huard, Spearman, and Williams |12)). for p — 5 we have 

- 1 + 2|n|io + -\n\ii + -\n\12 + -\n\i3 



2l"ll3l"l24l"l35|n|4 ^ I |iU i ^1 111 I g 

8 4 1 9 13 

+ o l"-l20 + |n-|2i + 7:|^^|22 + « I'^ba + 3|n|3o + -\n\31 + -\n\32 + 7^l"-|33 
3 9 6 8 2 16 

16, , 6, , 8 , , 1, , 
+ y fUo + -^Hii + Y^F|42 + -^\n\i3, 

and so on. To give a very explicit example, the base-5 representation of 1947 is 
30242, so 025(1947) = 32 ■ 41 • 51 • (1 -I- 3 + 8/15) 816. 
For a = 3 the theorem provides 

7^ - E 1 + E ^(-) + E ^(-) + E ^(-)^(-)- 

' n/{} n/{2} n/{3} n/{2,2} 

The first three sums can be directly rewritten in terms of |n|^. The final sum 
over nonoverlapping pairs of length-2 subwords can be written as the sum over 
unrestricted pairs of subwords minus the sum over overlapping pairs of subwords 
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(of which there are two kinds — overlapping in one letter and overlapping in both) : 

n/{2,2} 

l-l l-l 1-2 l-l 

= ^ ^ c{ni+ini)c{nj+inj) - 2 ^ c(ni+2»^i+i)c(ni+in,;) - ^ c{ni+ini)'^ . 

i=0 j=0 i=0 4=0 

The coefficients take care of symmetries among the subword lengths. Since each 
of these three new sums consists of sums over the entire word length, they can be 
rewritten to show the dependence on subwords of lengths 2 and 3 as 



c{v)c{w)\n\y\n\jjj ~ 2 ^ c{w2Wi)c{wiWo)\n\w - ^ c{w) 



where [p] = {0, 1, . . . , p — 1} is the alphabet of base-p digits. Thus we can write out 
an expression for aps^n) in terms of Letting p = 2 in this expression gives 



as{n) = 2l"li h + -|n|io + |n|ioo + ^|n|iio + -\n\ 



which was obtained by Huard, Spearman, and Williams [14] . Formulas for other 
primes can be found similarly: p = 3 gives 

a27in) 1,, 7., 4 5. 

= 1 + 7tI"|iO + TTTTFIh + 7tI"|20 + TT7l"-|21 



2l"li3l"l2 2' ' 32' ' 9' ' 18' 

3 3 3 1 1 

+ 3|n|ioo + tI'^Iioi + tI^Iiio + T^l^lin + o I'^bo + t^I»^|i21 

4 4 lb 3 12 

14 1 

+ 4|n|200 + I'^boi + |«|210 + ^I'^bu + g |"-|220 + g l"-|221 

+ ^I'^lio + ^l"|io|n|ii + ^|n|io|n|2o + ^|n|io|n|2i + ^l"lii 

+ ^Inliilj^bo + ^I'^liil^bi + ^Hlo + ^|n|2o|»^|2i + Ys^^Ali- 

For a general a we will need to be able to rewrite J2n/p c{v)c{w) ■ ■ ■ c{z) — the 
sum over subwords with overlaps — in terms of |n|^ for any given partition P. 
To do this we can use inclusion-exclusion to express this sum as sums over sets of 
words with forced overlap conditions rather than restrictive overlap conditions: 

^c(v)c(u))---c(z) = ^(-1)* c{v)c{w)---c{z). 

n/P i>0 ways to guarantee 

i overlaps 

Now each term is a sum over all sets of subwords of the desired lengths where 
certain pairs of subwords are required to overlap. For such a sum, determine the 
"connected components" induced by these pairs, and for each connected component 
find all clusters of the subwords in which the required pairs overlap. Then allow 
each connected component to range independently over the entire word n. Since 
each sum now is over the entire word n, the expression can readily be rewritten in 
terms of \n\j^. 

An implementation of this procedure is available in the Mathematica package 
BinomialCoefficients [17] ■ This implementation produces formulas for aiQ{n), 
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032 (n-), and 064 (n) fairly quickly on a standard machine (in less than a minute for 
aeiin))- For example, 

aie(n) 5.. 1,. 1.. .. 1.. 1.. 

2|„|, = 1 + Y2 ^ 2'*^'^°° gFliio + 2|n|iooo + 21*^11010 + 3 

+ ^|«|iiio + Y^Hio + ^l^liol^ilioo + ^|ri|io|n|iio + ^I'^lio- 

For slightly larger powers of 2, the expressions do not become unmanageably large, 
but the running time of the computation does grow quickly because of the many 
ways to force i overlaps. Computing 0128 (^^) took an hour and a half. After perform- 
ing these computations, the author was made aware of Everett's work 0, which 
raises the possibility of using Everett's expression for A„(/?) to compute formulas 
for CpQ (n) more quickly. This question has not been investigated, although the 
results would be interesting to know. 

For an integer partition P, rewriting J2n/p c{v)c{w) ■ ■ ■ c{z) as described yields 
a multivariate polynomial in for various words w. Therefore apa[n)/ap{n) is 
also a polynomial in \n\w. For 2 < 7 < 2{a — 1), the longest partition in Saij) 
is {3, 2, ... , 2} or {2, 2, ... , 2} and has length [7/2J , so the degree of Opc {n)/ap{n) 
is a — 1. Moreover, the longest clusters occurring for a given a have length a, so 
apa{n)/ap(n) depends only on |n|^ for words w of length at most a. 

Theorem 2. Let p be a prime, and let a > 1. Then apa{n)/ap{n) is a polynomial 
of degree a — 1 in \n\w for \w\ < a. 

It would be nice to know more about these polynomials: How does the number 
of terms grow? What can be said about the coefficients? In particular, why are 
the coefficients always nonnegative? Can any sense be made of them as series 
expansions for (n + 1)/ ap{n) if we fix p and let a — >■ 00? For example, the coefficient 
of |n|io in a2a(n)/2l"li for a = 1,2, . . . takes on the values 

1 3 5 77 391 259 
'2' 8' 12' 192' 960' 640'"'' 
and a plot of these values suggests that the limit of this sequence exists. 

For any w it is known that \n\w is a p- regular sequence in the sense of AUouche 
and Shallit \V\ Theorem 6.1]. That is, \n\w is determined by a finite set of linear 
recurrences in \p^n + i\w along with finitely many initial conditions. From closure 
properties of p-regular sequences it follows from Theorem [2] that apa{n) is also p- 
regular. Experimental evidence suggests that the rank of apc. (n) — the minimal 
number of initial conditions required — is 2a — 1. We leave this as another open 
problem. 
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